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ATGAGCTATTACGGAAGTAGCTACCGGATTGTCAATGTGGACTCCAAATATCCAGGCTATCCTCC 
AGAGCATGCCATCGCTGAGAAGAGAAGAGCAAGAAGGCGCTTGCTCCACAAAGATGGCAGCTGTA 
ATGTGTACTTTAAACACATTTTTGGAGAATGGGGGAGCTACATGGTTGATATTTTTACCACTCTT 
GTGGATACCAAGTGGCGCCATATGTTCATAATATTTTCTCTGTCTTACATTCTCTCCTGGTTGAT 
ATTTGGTTCCATATTTTGGCTCATAGCCTTTCATCACGGAGACCTATTAAGCGATCCAGATATCA 
CCCCTTGTGTTGACAACGTGCATTCATTTACGGCTGCATTTTTATTCTCCCTGGAGACCCAGACC 
ACCATTGGATACGGTTACCGCTGTGTCACCGAAGAGTGCTCTGTGGCTGTACTGACAGTGATCCT 
TCAGTCCATCCTCAGCTGCATCATAAACACCTTCATCATTGGAGCAGCCTTGGCAAAGATGGCAA 
CTGCCCGGAAGAGAGCCCAGACCATACGCTTCAGCTATTTTGCCCTCATTGGTATGAGAGACGGG 
AAGCCTTGCCTCATGTGGCGCATAGGTGACTTCCGACCAAACCATGTGGTAGAGGGCACGGTGAG 
AGCCCAACTTCTGCGCTATTCAGAAGACAGTGAAGGGAGGATGACGATGGCGTTTAAAGACCTCA 
AACTCGTCAATGACCAGATAATCCTGGTAACTCCAGTGACTATTGTCCATGAAATTGACCATGAG 
AGCCCTCTGTATGCCCTTGACCGCAAGGCAGTGGCCAAAGATAATTTCGAGATTCTGGTGACATT 
TATTTATACTGGTGATTCCACTGGGACATCCCACCAGTCCAGAAGTTCCTACATCCCCAGAGAAA 
TTCTCTGGGGCCACAGGTTTCATGATGTATTGGAAGTGAAGAGAAAGTACTACAAGGTGAACTGC 
TTGCAGTTTGAAGGAAGCGTGGAAGTCTACGCCCCCTTTTGCAGTGCCAAACAACTGGACTGGAA 
GGACCAACAACTCAACAACTTGGAGAAAACGTCCCCTGCCCGAGGATCCTGCAATTCTGACACCA 
ACACCAGGAGGCGGTCCTTCAGCGCAGTTGCCGTGGTGAGCAGCTGTGAGAACCCAGAGGAGACC 
GTCCTGTCCCCACAAGATGAATGTAAGGAGATGCCCTATCAGAAAGCCCTCCTGACTTTAAATAG 
G ATCTCC ATGG AATC C C AG ATG (SEQ ID N0:1) 
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MSYYGSSYRIVNVDSKYPGYPPEHAIAEKRRARRRLLHKDGSCNVYFKHIFGEWGSYMVDIFTT 

LVDTKWRHMFIIFSLSYILSWLIFGSIFWLIAFHHGDLLSDPDITPCVDNVHSFTAAFLFSLET 

QTTIGYGYRCVTEECSVAVLTVILQSILSCIINTFIIGAALAKMATARKRAQTIRFSYFALIGM 

RDGKPCLIXn^IGDFRPNHWEGTVRAQLLRYSEDSEGRMTMAFKDLKLVlSrDQIILVTPV 

IDHESPLYALDRKAVAKDNFEILVTFIYTGDSTGTSHQSRSSYIPREILWGHRFHDVLEVKRKY 

YKVNCLQFEGSVEVYAPFCSAKQLDWKDQQLNNLEKTSPARGSCNSDTNTRRRSFSAVAWSSC 

ENPEETVLS PQDECKEMPYQKALLTLNRI SMESQM (SEQ ID NO : 2 ) 
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underlined = deleted in targeting construct 

BOLD = sequence flanking Neo insert in targeting construct 



ATGAGCTATTACGGAAGTAGCTACCGGATTGTCAATGTGGACTCCAAATATCCAGGCTAT 
CCTCCAGAGCATGCCATCGCTGAGAAGAGAAGAGCAAGAAGGCGCTTGCTCCACAAAGAT 
GGCAGCTGTAATGTGTACTTTAAACACATTTTTGGAGAATGGGGGAGCTACATGGTTGAT 

ATTTTTACCACTCTTGTGGATACCAAGTGGCGCCATAT GTTCATAATATTTTCTCTGTCT 
TACATTCTCTCCTGGTTGATATTTGGTTCCATATTTTGGCTCATAGCCTTTCATCACGGA 
GACCTATTAAGCGATCCAGATATCACCCCTTGTGTTGACAACGTGCATTCATTTACGGCT 
GCATTTTTATTCTCCCTGGAGACCCAGACCACCATTGGATACGGTTACCGCTGTGTCACC 
GAAGAGTGCTCTGTGGCTGTACTGACAGTGATCCTTCAGTCCATCCTCAGCTGCATCATA 
AACACCTTCATCATTGGAGCAGCCTTGGCAAAGATGGCAACTGCCCGGAAGAGAGCCCAG 
ACCATACGCTTCAGCTATTTTGCCCTCATTGGTATGAGAGACGGGAAGCCTTGCCTCATG 
TGGCGCATAGGTGACTTCCGACCAAACCATGTGGTAGAGGGCACGGTGAGAGCCCAACTT 
CTGCGCTATTCAGAAGACAGTGAAGGGAGGATGACGATGGCGTTTAAAGACCTCAAACTC 
GTCAATGACCAGATAATCCTGGTAACTCCAGTGACTATTGTCCATGAAATTGACCATGAG 
AGCCCTCTGTATGCCCTTGACCGCAAGGCAGTGGCCAAAGATAATTTCGAGATTCTGGTG 
ACATTTATTTATACTGGTGATTCCACTGGGAC ATCC C AC C AGTCC AGAAGTTCC TAC ATC 
CCCAGAGAAATTCTCTGGGGCCACAGGTTTCATGATGTATTGGAAGTGAAGAGAAAGTAC 
TACAAGGTGAACTGCTTGCAGTTTGAAGGAAGCGTGGAAGTCTACGCCCCCTTTTGCAGT 
GC C AAAC AACTGG AC TGGAAGGAC C AAC AAC TC AAC AAC TTGGAGAAAAC GTC C C C TGC C 
CGAGGATCCTGCAATTCTGACACCAACACCAGGAGGCGGTCCTTCAGCGCAGTTGCCGTG 
GTGAGC AGCTGTGAGAAC CC AGAGGAGACC GTC C TGTC CC C AC AAGATGAATGTAAGGAG 
ATGC C CTATC AGAAAGCC C TC CTGAC TTT AAATAGGATC TCC ATGGAATC CC AGATG 
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Gene Sequence Structure 

Size of partial cDNA: 1257 bp 



219 bp Sequence Deleted 419 bp 




Targeting Vector* 
(genomic sequence) 



Arm Length: 

5': 1.5 kb 
3': 3.5 kb 



— Targeting Vector 

_ - _ ^ - Endogenous Locus 

* Not drawn to scale 



LacZ-Neo 




5 1 >AGCTACAGGATCGTCAATGTG : 
GAC TC C AAAT ATC C AGGC T ATCC T . 
C C AGAGC ATGC C ATCGCTGAGAAG * 
AGAAGAGCAAGAAGGCGCTTGCTC | 
CACAAAGATGGCAGCTGTAATGTG j 
TACTTTAAACACATTTTTGGAGAA | 
TGGGGGAGC T AC ATGGTTG AT ATT 
TTTACCACTCTTGTGGATACCAAG ) 
TGGCGCCATATO 1 
(SEQ ID N0:3) 



5 1 >CGAAGAGTGCTCTGTGGCTGT 
ACTGACAGTGATCCTTCAGTCCAT 
CCTCAGCTGCATCATAAACACCTT 
CATCATTGGAGCAGCCTTGGCAAA 
GATGGC AAC TGC CC GGAAGAGAGC 
CCAGACCATACGCTTCAGCTATTT 
TGCC C TC ATTGGTATGAGAGACGG 
GAAGCTTTGCCTCATGTGGCGCAT 
AGGTGACTTCC<3 1 
(SEQ ID N0:4) 
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